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1. INTRODUCTION 

Breast cancer is a significant cause of mortality among women, and it is characterized by an 
unchecked growth in the number of irregular cells in the breast. Unfortunately, stopping the growth of breast 
tumors has proven impossible until now [1]. Early detection of breast cancers is critical for accurate 
diagnosis, and this disease is responsible for about 15% of all cancer-related deaths [2]. Therefore, breast 
tissue lumps should be detected as early as possible to maximize the chances of survival. In the early stages, 
tumors can be treated and provide valuable information regarding the emergence of cancer, which can assist 
boost survival rates [3]. Breast biopsy, mammography, ultrasound, thermography, and fine-needle aspiration 
cytology are just a few of the procedures that can be used to diagnose this condition. To now, mammography 
has been widely used for detecting breast cancer; in some cases, this is insufficient, and doctors will require a 
biopsy before making a final decision [4]. Furthermore, the accuracy of the detection rate is low 60% and 
70%. Additional tests, some of them expensive and time-consuming, will be required of the patient [5]. 

Breast cancer detection uses mammograms [6], ultrasounds, and biopsies, as well as machine 
learning (ML) system testing, to find out if someone has the disease [7]. The term biopsies are used to take a 
sample of breast cells for testing. Breast cancer can be diagnosed with a biopsy, a reliable procedure. A 
biopsy test directs using a specialized needle instrument to obtain a sample of questionable tissue during a 
biopsy. Biopsy may be asked to have a small metal marker placed in the breast to aid future imaging tests. 
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Experts in a laboratory examine the cells to see if they are cancerous. It's also important to know what type of 
cells are involved in breast cancer, how aggressive it is (grade), and if the cells have hormones or other 
receptors that could alter your treatment options [8]. After a breast biopsy, the medical decision may take a 
few days. As soon as a biopsy is completed, the breast tissue is sent to the specialized laboratory, then 
analyzed by an expert in the field of blood and body tissue research (pathologist). The pathologist's report 
includes information about the tissue samples’ size, consistency, and location, whether cancerous or non- 
cancerous cells. At times, these experts have different viewpoints, which leads to more surgery necessary to 
harvest additional tissue for further location analysis [9]. Researchers have used ML approaches to overcome 
the drawbacks of typical diagnostic methods, provide a second opinion to physicians, and lower the 
likelihood of human error, which could result in patient death [10]. In decision support systems, ML 
techniques are beneficial in increasing the survival rate because they aid in the automation of the decision- 
making process. Improve precision and speed of response increases and reduce the amount of work that 
doctors have to do, especially when there is a medical personnel shortage [11]. Decision-support and health- 
monitoring systems that use ML approaches can help improve other medical services, such as preventive care 
and mental health treatments for specific individuals [12]. 

This research examined strategies for detecting breast cancer using ML algorithms and Wisconsin 
diagnostic breast cancer (WDBC) dataset: Azar and El-Metwally (2013) [13] used three different types of 
decision tree algorithms. Decision trees are classified into three categories: boosted decision tree (BDT), 
single decision tree (SDT), and decision tree forest (DTF). BDT outperformed SDT across the board. The 
BDT had a sensitivity, accuracy specificity, precision, and area under curve (AUC) of 98.42%, 98.83%, 
99.58%, 99.77%, and 0.99, respectively. Senapati and Dash (2013) [14] implemented an artificial neural 
networks (ANN) algorithm for categorizing breast cancer tumors and suggested the use of the firefly 
algorithm (FA) to enhance the performance of a local linear wavelet neural network. Specifically, FA was 
used to improve the parameters of a local linear wavelet neural network. The approach achieved a 
classification accuracy of 98.14%. Azar and El-Said (2014) [15] described a type of support vector machine 
(SVM) algorithms; they could rapidly, easily, and efficiently diagnose breast cancer. According to the 
training data, Lagrangian SVM has the lowest accuracy 95.6107%, whereas standard SVM beat all other 
techniques in terms of accuracy 97.71%, sensitivity 98.24%, specificity 95.08%, precision 97.39%, and AUC 
0.99. Aliékovic and Subasi (2017) [16] presented data mining techniques with and without genetic algorithm- 
based feature selection to appropriately categorize medical data. The method simplifies computing and 
speeds up data mining. In the second stage, to choose between two groups. Individual and multiple algorithm 
methods were used to develop an accurate system for detecting breast cancer. The accuracy of the random 
forest is 99.48%, and AUC is 0.999. Wang et al. (2018) [17] presented twelve different SVM that are 
hybridized based on weighted area under the curve ensemble (WAUCE) method. The model performance has 
been measured; the experimental findings indicated the model was achieved the highest accuracy 97.10%, 
sensitivity 97.11%, specificity 97.23%, and AUC 0.97. Kumar et al. (2020) [18] focused on implementing 
several classification approaches for data mining and classifying tumors as malignant or benign breast 
cancer. These twelve algorithms had a combined score of 94%. With an accuracy of 73.21%, only Naive 
Bayes has underperformed the others. Algorithms based on random trees and random forests have performed 
remarkably well, with accuracy close to 99%. And the sensitivity, which was found to be 99.10%. 
Lahoura et al. (2021) [19] employed ANN as a algorithm in a system for cloud-based breast cancer detection. 
Comparisons were made between the best extreme learning machine performance results obtained from 
stand-alone systems and cloud solutions. As a result of the tests’ accuracy is 98.68%, precision is 90.54%, 
sensitivity is 91.30%, and the Fl-score is 81.29%. 

The main contribution of this paper is to utilize the capability of ML models and implement them in 
breast cancer diagnosis. The best performance results of ML are compared by finding optimum values for its 
weights. Additionally, almost all models available are compared for finding the optimal, simple, and efficient 
model can breast cancer diagnosis. The rest of this paper is categorized: in section 2, a methodology is 
presented; in section 3, the experimental environment's setup is described; in section 4, the different results 
obtained are discussed in this work; and in section 5, we state the importance of the results, conclusions and 
future work. 


2. METHOD 

The goal of this work was to determine the most effective and predictive algorithm for breast cancer 
diagnosis. We applied ML algorithms decision tree, linear discriminant analysis, forests of randomized trees, 
gradient boosting (GB), passive aggressive (PA), logistic regression (LR), Naive Bayes (NB), nearest 
centroid (NC), and SVM, perceptron to accomplish on the WDBC dataset. Also, we evaluated the results to 
determine which model provides the highest accuracy. The methodology begins with data acquisition and 
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continues with pre-processing, including attribute selection. The pre-processed data is employed to improve 
ML algorithms that predict breast cancer. To assess the algorithms’ performance, the model takes new data 
that has been labeled-typically accomplished by dividing the labeled data collected into two parts. First part, 
split 80% of used data to train ML model, which is referred to as the training data or training set. While 20% 
of the data is utilized to evaluate the model's performance; this is called test data or test set. After evaluating 
the models, we compare the findings to determine which algorithm gives the highest accuracy and which is 
the most predictive for breast cancer detection. 

Predictive analysis of ML techniques is accomplished by using these algorithms: A decision tree 
(DT) is a supervised learning non-parametric approach for classification. The DT consists of many nodes that 
form a rooted tree, which is a tree with no incoming edges and a "root" node. One incoming edge connects 
each of the other nodes. An internal or test node is a node having outgoing edges, and all additional nodes are 
referred to as leaves (decision or terminal nodes). DT is easy to understand and interpret. It's possible to see 
trees in your mind's eye. It is able to work with both numerical and category information [20]. A linear 
discriminant analysis (LDA) is a well-known algorithm featuring linear decision surfaces. This algorithm is 
appealing because they offer closed-form solutions that are simple to calculate, are intrinsically multiclass, 
have been demonstrated to function in reality, and have no hyperparameters to tune [21]. A Forests of 
randomized trees (FRT) are from ensemble learning methods for resolving complex problems by merging 
several algorithms based on DT predictions. The FRT algorithm decides the outcome by averaging the output 
of different trees. As the number of trees grows, the result's precision improves. FRT produces precise 
predictions that are simple to learn. It can successfully handle vast amounts of data. The FRT algorithm is 
more accurate in predicting outcomes than the DT method [22]. A GB refers to an extension of boosting to 
arbitrary differentiable loss functions. GB is a ML approach to improve a model's predictive value by 
continuously improving its predictive value. Many weak learners are combined into one powerful learner in 
GB. Weak learners used are the decision trees for each user. Although learning a boosting algorithm can be 
time-consuming, it is also quite accurate. When it comes to learning algorithms, slower-learning models 
outperform faster learners . Each iteration of the DT changes the values of the coefficients, weights, or biases 
assigned to each of the input variables used to estimate the target value to lower the loss function (the 
measure of the difference between the predicted and actual target values) [23]. A PA is a collection of ML 
techniques. However, it can be quite beneficial and effective in some situations. PA algorithms are 
commonly employed for large-scale learning [24]. It is highly beneficial in cases where there is a large 
quantity of data, and training the whole dataset is computationally impossible due to the sheer volume of the 
data [25]. A LR is a robust ML technique for classifying incoming data and generating probabilities from 
discrete and continuous datasets. LR can quickly identify the most efficient factors for classification and 
categorize observations based on various data sources [26]. LR is used instead of fitting a regression line and 
fitting an "S" shaped logistic function in LR, which predicts two maximum values (1 or 0). The probability 
shows the curve of the logistic function [27]. A NB is a simple algorithm to build and is highly successful 
when dealing with large data. NB is known to outperform even the most powerful classification because it is 
simple and requires fewer data [28]. Determining the class of the test data set is simple and easy when the 
independence threshold is reached [29]. The NC algorithm is a algorithm that assigns labels to a class of 
training samples based on their centroid (mean) distance from the observed object or data. The distances 
between the empirical data and various class centroids are sorted, and the closest distance is chosen [30]. 
Perceptron is a kind of neural network, and it is used for binary classification (two-class) as a linear 
classification approach. It learns a decision boundary in the feature space that uses a line to separate two 
classes (hyperplane). It is excellent for algorithms that can be readily divided using a linear model and is well 
suited for large-scale learning [31]. SVM is a ML algorithm that can generalize across two different classes if 
the method is provided with a set of labeled data in the training set. For the support vector classifier (SVC), 
the most important task is finding a hyperplane that can distinguish between similar and different data 
classes. This model is effective because it can work well in complex three-dimensional environments. The 
method is still effective even when the number of dimensions exceeds the number of samples [32]. 


3. EXPERIMENT ENVIRONMENT 

The WDBC dataset was collected by Walberg et al. from the University of Wisconsin Hospitals in 
Madison [33]. WDBC may be found in the University of California Irvine repository for ML [18]. A data of 
a fine needle aspirate (FNA) of a breast lump is used to compute features; cell nuclei are described in terms 
of the image's features. WDBC was the patient ID, 30 tumor characteristics, and one class indicator included 
in the WDBC dataset, containing 569 cases (62.74% benign, 37.26% malignant) [19]. There were 30 
characteristics in total, including area, texture, radius, perimeter, compactness, smoothness, concavity, 
concave points, and symmetry. The ID number feature was removed because it has no impact on 
classification. The scikit-learn library and the Python language were used for all of the experiments on the 
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ML algorithms (DT, LDA, FRT, GB, LR, PA, Perceptron, NB, NC, and SVC) discussed in this work. It was 
implemented with Python's NumPy, SciPy, pandas, and matplotlib libraries. 


4. RESULTS AND DISCUSSION 

WDEC was classified into benign and malignant tumors using ten classification methods: DT, LDA, 
FRT, GB, LR, PA, Perceptron, NB, NC, and SVC. For each model, we evaluated the performance of models 
from five aspects: specificity, sensitivity, precision, accuracy, Fl-score. Figure 1 provides the results 
obtained after the models implemented specificity. This result is significant at the maximum of 100% for 
LDA, LR, PA, NB, and SVC models. Figure 2 presents the results obtained of sensitivity. It can be seen that 
the Perceptron model had the highest value of 97.87%. 


specificity% 


sensitivity% 
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Figure 1. The specificity of classification models Figure 2. The sensitivity of classification models 


The results of the accuracy analysis can be compared in Figure 3 for all used models. There was a 
significant positive accuracy among almost all models; the results indicate that the GB model had the highest 
accuracy of 97.36%. Figure 4 shows the results obtained for the precision of models. LDA, LR, PA, NB, and 
SVC models had values of 100%, whilst the precision for the perceptron model was the lowest at 82.14%. 
The average scores of specificity and sensitivity can be compared using the Fl-score as shown in Figure 5. It 
is apparent that the GB model had the highest value of 96.77%. The most notable result to observe from the 
results is the efficiency of ML in the prediction and diagnosis of breast cancer. The mean accuracy was 
94.43% for all implemented models except the NB model. We observed GB model achieved the highest 
results with this data, was stable, and its results were high in all respects (Fl-score mainly). 
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Figure 5. Fl-score for classification models 


Furthermore, radiologists are interested in the outcome of the trade-off between sensitivity and 
specificity. Hence, sensitivity refers to a diagnostic's ability to classify patients with cancer as abnormal. In 
contrast, specificity refers to a diagnostic's ability to classify patients who do not have cancer as normal. As 
for ML programmers, they are focused on the receiver operating characteristic (ROC) curve results. The 
ROC curve is created by computing and graphing the true positive rate vs. the false positive rate. Since it 
shows the model's stability and reliability, it reached a maximum value of 0.99 in GB, as illustrated in the 
Figure 6. 
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Figure 6. ROC curve for gradient boosting 


This result may be explained by the fact that GB is an ensemble method that learns from current 
predictor errors to improve future predictions. The technique combines the weak classification model into 
one powerful learner, improving model predictability. By putting weak learners in sequence, weak learners 
learn from the next learner, improving prediction models. Ensemble methods are suitable for minimizing 
model variation and improving prediction accuracy. The variance is decreased when many models are joined 
to generate a single prediction. New predictors are fitted to overcome earlier predictor faults. GB increases 
the model's accuracy by adding predictors sequentially to the ensemble. GB uses the gradient to discover and 
correct errors in learners’ predictions. 

The best results achieved in this work are compared with other results discussed in the literature are 
highlighted in Table 1. These results agree with the findings of other studies in which ML effectively predicts 
the presence of breast cancer. To conclude, the ML models accomplished a higher classification accuracy 
rate, decreased the false positive, and obtained a higher performance rate. As a result of this research, it has 
been shown that the ML is beneficial in assisting a radiologist in making an accurate breast cancer diagnosis 
and that it may form the basis for future work that may eventually be used to assist radiologists. 
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Table 1. Comparison of performance with previous researchers’ results 


Reference Year Machine learning Accuracy Sensitivity Specificity Precision AUC 
technique % %o %o % 
Azar and El-Metwally [13] 2013 BDT 98.83 98.42 99.58 99.77 0.99 
Senapati and Dash [14] 2013 wavelet-ANN 98.14 - - - - 
Azar and El-Said [15] 2014 LPSVM 97.14 98.24 95.08 97.39 0.99 
Aliékovicé and Subasi [16] 2017 Random forest 99.48 - - - 0.99 
Wang et al. [17] 2018 SVM 97.10 97.11 97.23 - 0.97 
WAUCE 
Kumar et al. [18] 2020 FRT 99.14 99.10 - 99.10 - 
DT 99.14 99.10 99.10 
Lahoura et al. [19] 2021 ANN 98.68 91.30 90.54 81.29 - 
Our work GB 97.36 97.87 100 100 0.99 


5. CONCLUSION 

The early identification of breast cancer is one of the most important scientific topics being pursued. 
The purpose of the research was to compare the performance of classification algorithms. The efficiency of 
ML in breast cancer prediction and diagnosis has been recognized to be the most important outcome to 
emerge from these findings. It is clear from this research that the GB algorithm outperforms with WDBC in 
terms of specificity and precision. Because this model is part of the ensemble approaches group, we 
anticipate that the high results will be ascribed to the fact that employing ensembles rather than a single 
predictive model increases predictive modeling performance in this case. These challenges will need to be 
resolved in more future studies. For this reason, future research will pay close attention to assessing ML 
models in additional medical diagnostic issues and optimizing their performance by using high-speed 
computing methods in the process. 
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